The origin of the Langevin equation and the calculation of the mean squared 
displacement: Let's set the record straight 
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Ornstein and his coauthors, who constructed a dynamical theory of Brownian motion, taking the 
equation mdv/dt = — (v + X as their starting point, usually named the equation after Einstein alone 
or after both Einstein and Langevin; furthermore, Ornstein, who was the first to extract from this 
equation the correct expression for A 2 , the mean-squared distance covered by a Brownian particle, 
credited de Haas-Lorentz, rather than Langevin, for finding the stationary limit of A 2 . A glance at 
Einstein's 1907 paper, titled "Theoretical remarks on Brownian motion", should suffice to convince 
one that it is not unfair to attribute the conception of the above equation, now universally known as 
the Langevin equation, to Einstein. Langevin's avowed aim in his 1908 article was to recover, through 
a route that was 'infinitely more simple', Einstein's 1905 expression for the diffusion coefficient, but 
a careful reading of Langevin's paper shows that — depending on how one interprets his description 
of the statistical behavior of the random force X appearing in the above equation — his analysis 
is at best incomplete, and at worst a mere tautology. Since textbook accounts are based on the 
interpretation that renders the proof fallacious, alternative derivations, which are adaptations of 
those given by de Haas-Lorentz and Ornstein, are presented here. Some neglected aspects of the 
contents of Ornstein's early papers on Brownian motion are also brought to light. 



1. INTRODUCTION 

It has become customary 1 to mark the birth of stochas- 
tic differential equations with the publication of the 1908 
piece 2 ' 3 by Langevin — because he was the first to inscribe 
such an equation. It is equally common to regard Orn- 
stein as the principal architect of the modern theory of 
Brownian motion — because he showed, in a series of semi- 
nal papers, 4-9 how one may extract physically significant 
information from what is now called Langevin's equation. 
Yet it does not appear to be widely known that, in nearly 
all his papers, Ornstein named the underlying stochastic 
equation after Einstein alone or after both Einstein and 
Langevin. 

Though the question "Did Ornstein have any grounds 
for associating the archetypal stochastic equation with 
Einstein?" will be answered in this article, it will not oc- 
cupy center-stage. My principal purpose is rather to draw 
attention to an unsatisfactory feature of Langevin's os- 
tensibly simple derivation of an expression for the mean- 
squared displacement. One of his propositions (that re- 
lating to the vanishing of the average value of the product 
of the particle position x and the random force X), on 
which the entire argument hinges, is offered without any 
proof; the singular ease with which the final result is ar- 
rived at provokes unease, rather than approval, when one 
notices that the average values of vX and xv, two closely 
related products involving the velocity v, cannot vanish, 
but they must do so according to the reasoning employed 
by Langevin. 

I argue here that Langevin's logic is seriously flawed, 
and its continued presentation to beginners no better 



than the dispensation of a nostrum to an unsuspecting 
patient. A demolition plan, such as that proposed here, 
stands a better chance of adoption if it is accompanied by 
a constructive suggestion; accordingly, I also present an 
alternative derivation, inspired by the edifice constructed 
by Ornstein and co-authors; though not as short as the 
original Langevin version, it is logically sound, and yields 
much greater insight. I also draw attention to an alter- 
native treatment, due originally to de Haas-Lorentz, 10 
who may be called a forerunner of Ornstein, and to some 
unaccountably neglected aspects of Ornstein's work. 



*K. Razi Naqvi: razi.naqvi@phys.nt nu.no| 



2. LANGEVIN, HIS FORERUNNERS, 
CONTEMPORARIES AND AFTERCOMERS 



Using different approaches, Einstein 11 and 
Smoluchowski 12 had arrived, shortly before Langevin 
turned his attention to the issue, at discordant ex- 
pressions for the diffusion coefficient D of a spherical 
particle. Langevin lodged two claims at the start of 
his analysis: "I have been able to determine, first of 
all, that a correct application of the method of Mr. 
Smoluchowski leads one to recover the formula of Mr. 
Einstein precisely, and, furthermore, that it is easy to 
give a demonstration that is infinitely more simple by 
means of a method that is entirely different" . [Emphasis 
in the original] 

It will be well to pause at this point and recall what 
Einstein wrote in his 1907 paper. I recapitulate, in the 
next two paragraphs, his statements after some condensa- 
tion and a modicum of paraphrasing, but without chang- 
ing his notation. 
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2.1. Einstein's 1907 paper 

If we knew nothing of the molecular theory of heat, 
Einstein wrote, 13 we should expect the following to hap- 
pen. Suppose that we impart to a particle suspended 
in a liquid a certain velocity applied to it from without; 
then the particle will come to a halt as a result of the 
friction of the liquid. If we ignore the inertia of the latter 
and assume that the resistance experienced by a particle 
moving with velocity v is 6TrkPv, where k is the viscosity 
of the liquid and P the radius of the particle, we obtain 
the equation 



Tfl—^- = —6TTkPv. 

dt 



(1) 



From this it follows that the velocity will fall to a tenth 
of its original value in a time 9 = In 10 m/(67r/cP); for a 
spherical particle of radius 2.5 /zm suspended in water, 9 
comes out to be 330 ns. 

If we take into consideration the molecular theory of 
heat, Einstein reminded the reader, we have to modify 
this conception. We must continue to assume, as before, 
that the particle will nearly completely lose its initial ve- 
locity in the very short time 9 through friction. But, at 
the same time, we must assume that the particle gets 
new impulses during this time by some process that is 
the inverse of viscosity, so that it retains a speed which 
on an average is equal to that implied by the Maxwell- 
Boltzmann distribution. But since we must imagine that 
direction and magnitude of these impulses are (approxi- 
mately) independent of the initial velocity of the particle, 
we must conclude that the velocity of the particle will be 
very greatly altered in the extraordinary short time 9, 
and, indeed, in a totally random manner. 

It is clear that Einstein conceived, already in 1907, a 
modified version of Eq. (1), the modification entailing the 
addition, on the right-hand side, of a random force. We 
shall see (in the next section) that his assumptions about 
the randomness of the additional force were echoed by 
Langevin when the latter put pen to paper. 'Let us with 
Einstein consider a colloidal particle,' wrote Ornstein et 
al. 9 in 1927 (without citing a specific Einstein article); 
they presented in their own phrase the above argument 
from Einstein's 1907 note, and wrote down the amended 
equation immediately afterwards. Thus, it seems safe to 
assume that Ornstein was familiar with the note, and 
it was this knowledge that motivated him to credit Ein- 
stein with the germinal idea for converting Eq. (1) into 
a stochastic differential equation. 



2.2. Langevin's 1908 paper 

Langevin accepted the validity of the equipartition 
principle by writing 



RT 



(2) 



where a bar denotes 'an average extended over a large 
number of identical particles', and the other symbols have 
their usual meanings. Using /j, and a instead of Einstein's 
k and P, respectively, he wrote the equation of motion 
in the form 



d 2 x „ dx 
m—r^r = —bn^ia— + X, 



dt 2 



dt 



(3) 



and added immediately afterwards: "About the comple- 
mentary force X, we know that it is indifferently posi- 
tive and negative and that its magnitude is such that it 
maintains the agitation of the particle, which the viscous 
resistance would stop without it" . It is to be noted that 
he says 'we know' while Einstein wrote 'we must assume'. 

Langevin multiplied Eq. (3) by x, and arrived, after 
some elementary manipulations at the following equa- 
tion: 



m d 2 x 2 2 dx 2 

— z mv = SiTua— V Xx. 

2 dt 2 p dt 



(4) 



Next, he took an ensemble average, argued that 'the av- 
erage value of the term Xx is evidently null by reason of 
the irregularity of the complementary forces X\ replaced 
the average mv 2 by its equipartition value, introduced 
the symbol 



z = 



dx 2 
~dT 



dx 2 
~dT' 



and arrived finally at his key equation: 



in dz RT 
--+2*Vaz=—. 



(5) 



(6) 



He was seeking after an expression for z, whose stationary 
limit was to be equated to 2D; this limit, to be denoted 
by Zoo, emerges as soon as one sets dz^/dt = 0. The 
result agrees with Langevin, who took a slightly longer 
route and found 



D 



RT 



(7) 



6irfj,aN 

thereby duplicating Einstein's result. 

2.3. An assessment of Langevin's paper 

It is worth remarking first that Langevin's use of sym- 
bols and his further manipulations are reminiscent of the 
steps choreographed by Clausius in a paper where he de- 
rived the virial theorem. 14 Clausius used the symbol X 
for the deterministic force when he wrote Newton's equa- 
tion of motion for a particle (moving in a non-resistive 
medium), m(d 2 x/dt 2 ) = X, and made use of the follow- 
ing identity: 



d 2 x / dx\ d 2 x 2 
M 2 ~ + Vdt) 



(8) 
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Neither the conception of Eq. (3) nor its transformation 
into a differential equation for x 2 can be called truly orig- 
inal. Be that as it may, it was Langevin who actually 
wrote down Eq. (3) and — regardless of whose brainchild 
it was — he eventually became its eponym. 

My main concern is with Langevin's justification for 
setting xX = 0. Many later authors have simply taken 
his word for the vanishing of this average. The reception 
accorded to his analysis may be summed up by quoting 
from a recent review: 15 "Langevin's derivation, however, 
was spectacularly simple and direct compared to the oth- 
ers (this is probably why it is Langevin's derivation rather 
than Einstein's that is usually found in modern text- 
books)". Among those who have endorsed Langevin's 
logic, one may mention Feynman, 16 Reif, 17 , Heer 18 and 
the authors 19 of a recent treatise devoted to the Langevin 
equation. 

I quote Feynman (who uses the symbol F for X): 

Now what about x times the force? If 
the particle happens to have gone a certain 
distance x, then, since the irregular force is 
completely irregular and does not know where 
the particle started from, the next impulse 
can be in any direction relative to x. If a; is 
positive, there is no reason why the average 
force should also be in that direction. It is 
just as likely to be one way as the other. The 
bombardment forces are not driving it in a 
definite direction. So the average value of x 
times F is zero. [Italics in the original] 

This line of reasoning would lead one to infer, as has 
indeed been done, that vX and xv also vanish. Let us 
deal with vX first. J. D. van der Waals Jr. 20 multiplied 
Eq. (3) by v, took an ensemble average, and set vX = 
to find 



mdv 2 
~2~dT 



= —6irr)av 2 



(9) 



Van der Waals knew that Eq. (9) was unacceptable, since 
it implied that v 2 must decay to zero, contravening the 
equipartition principle. There are two ways to resolve the 
difficulty: one may either scrap the supposition vX = 0, 
or discard Eq. (3) itself. Unfortunately, van der Waals 
made the latter choice, and spent some effort in formu- 
lating his own theory of Brownian movement, using a 
different stochastic equation. Though van der Waals did 
not succeed in winning support for his point of view, he 
contributed indirectly by engaging Ornstein in a fierce 
debate. In the course of rebutting van der Waals, Orn- 
stein and Burger (O&B) had to concede that Eq. (3) is 
an approximation, albeit an excellent one. 7 The limita- 
tions of Langevin's equation, which are easily perceived 
(see below), ought to be emphasized even when it is in- 
troduced in an elementary course. 

To notice the untenability of the assumption xv = 0, 



made by Heer, one only has to note that since 
d x^ 1 

—— = 2 xv (identically), (10) 
the vanishing of xv implies the vanishing of D itself. 



2.4. The treachery of physical intuition 

It seems devilishly difficult to divine, on the basis of 
physical intuition alone, the statistical independence of 
two random variables in a dynamical problem. 21 Let no 
one forget that Maxwell assumed, in his first attempt 
to deduce the law of distribution of molecular velocities, 
that the distribution of the velocity component along 
a particular cartesian axis was independent of the val- 
ues along the other two axes. 22 According to Chapman 
and Cowling, 22 "it would be natural to suppose that 
these components are not independent". Seven years 
later, Maxwell repudiated this assumption, and made 
a second attempt, 23 which too proved to be unsatisfac- 
tory when scrutinized by Boltzmann. After attributing 
the proof presented in their book to Lorentz (who im- 
proved Boltzmann's exposition) Chapman and Cowling 
comment: "This proof also is open to some objection, be- 
cause of the assumption . . . that there is no correlation 
between the velocity and the position of a molecule" . 

Let us return now to Langevin, who accepted that z 
eventually attains a stationary value. After averaging 
Eq. (4), he should have seen that only one conclusion 
was justified: the value of xX is (or becomes, at long 
times) a constant that cannot be smaller than —RT/N. 
Had he allowed for the possibility xX ^ 0, he would have 
found, instead of Eq. (6), 



m dz RT 

__+ 3*^ 

from which one can only deduce that 



D = — 

2 



67r/ia 



RT — 

-w +xX 



(11) 



(12) 



Since Langevin's aim was to validate Eq. (7) , the assump- 
tion xX = reduces his demonstration to a tautology. 

The foregoing arguments amount to little more than 
an elaboration of a footnote in which O&B, who used 
the symbol A for the mean-squared displacement, stated: 
"From the formula (1) [Eq. (3) here] we can deduce the 
relation A 2 = bt if we introduce suppositions, it is how- 
ever impossible to find the value of b, without penetrating 
into the mechanism of the Brownian motion" 7 



2.5. Statistical properties of the random 
acceleration 



I find it hard to evade the conclusion that Langevin 
was tempted into making an error that has dodged de- 
tection on account of its seductive simplicity. Still, I 
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will now adopt the exegesis, offered by O&B as well as 
Manoliu and Kittcl, 24 that Langevin's pithy phrase 'in- 
differently positive and negative' is to be interpreted as 
follows: it implies not only that X = but also a supple- 
mentary relation, concerning the autocorrelation of X, 
from which one can deduce the vanishing of the cross- 
correlation xX. But these authors also noted the ne- 
cessity of proving the vanishing of xX from the second 
relation. It is imperative to do so because, as Mano- 
liu and Kittel point out, 24 physical intuition might lead 
one to conclude that xX ^ 0; after all, an encyclopae- 
dia entry, 25 cited by them, does warn the reader: "The 
mean random force vanishes, but the mean product of 
the random force into the displacement [in three dimen- 
sions] is equal to — 3kT; the frequent assertion that this 
product vanishes is mistaken" . Since proving the relation 
xX entails, even if one takes the short cut devised 
here, a good deal of preparatory work, the route taken by 
Langevin, which has been praised for its brevity and sim- 
plicity, turns into a wearisome trek (from the student's 
point of view), making Einstein's treatment look like a 
jaunt — unless the teacher is prepared to replace the as- 
sertion 'it is self-evident that xX = 0' by the assurance 
'it can be shown that xX = 0'. 

Enough has now been said to convince the reader that 
Langevin's derivation of the mean squared displacement 
must be rectified, or replaced by a cogent alternative. All 
the ingredients for accomplishing the latter task were as- 
sembled by de Haas-Lorentz, but she stopped short, as 
did Langevin before her, of going beyond the stationary 
limit. It was Ornstein who, actually worked out, using 
his own approach, the complete expression in 1917. The 
prospects of adding something novel to this topic are in- 
finitesimal indeed, but a repackaging of the material scat- 
tered in Ornstein's many publications seems worthwhile. 
I wish to present a slightly different and shorter account 
than appears in the 1930 article 26 of Uhlenbeck and Orn- 
stein (U&O). As a preliminary, however, I wish to recall 
some statements made by Ornstein in his first paper. 



3. ORNSTEIN'S 1917 PAPER 

Ornstein began his first paper on Brownian motion by 
citing the works of Smoluchowski and Burger; referring 
to the thesis of de Haas-Lorentz, later published as a 
monograph, 10 he remarked: 

It is worth observing that the way differ- 
ent averages depend on time may be calcu- 
lated from the results obtained by Mrs. de 
Haas-Lorentz by a slightly more careful tran- 
sition of the limit than was necessary for the 
object she had put herself (viz. the determi- 
nation of the stationary condition). First, I 
want to determine these averages by a new 
method . . . 



The relation used by de Haas-Lorentz and mentioned 
by Ornstein has the following form (though they use dif- 
ferent symbols): 



m Tt=- Cv + x ' 



(13) 



which has some advantages over the form chosen by 
Langevin: The use, on the right-hand side, of the sym- 
bol ( rather than 6irna saves some writing, besides ex- 
tending the applicability of the equation to non-spherical 
particles; the introduction of the particle velocity v also 
simplifies the resulting formulas, but the main reason for 
the change is to emphasize that velocity, rather than the 
position, is the quantity of greater interest here. Upon 
dividing Eq. (13) by m, and setting 



m 



and A 



X 
m 



one gets the still neater form 
dv 



dt 



= —av + A. 



(14) 



(15) 



Ornstein carried out a formal integration of Eq. (15) 
to get 



v e 



Jo 



e atl Ait^dh, v o =v{0). (16) 



The fact that A(t\) on the right-hand side is a random 
function did not worry him unduly, since he knew that 
he would attribute physical significance only to averaged 
quantities. It is important to recall at this stage that, 
in the works published during 1917-1930, Ornstein cal- 
culated an average over those molecules which start, at 
time t = 0, with a definite velocity (say vo); I will refer 
to an average of this type as a vo-average, and distin- 
guish it from an overall average (the quantity considered 
by Langevin) by appending the symbol vo to the bar sig- 
nifying a particular average. A wo-average can itself be 
averaged over all initial velocities (distributed according 
to the Maxwell-Boltzmann law) . If one's aim is to obtain 
only the overall average, it often proves more convenient 
to bypass the ZJo-averaging. For the sake of complete- 
ness, let us take note of a mean value that will be needed 
shortly. If we take a wo-average of Eq. (15), we arrive, 
after setting ~A"° = 0, at the equation 



dir* 



dt 



= —av 



whose solution is 



v v « =v e~ at . 



(17) 



(18) 



This is, of course, the same result as would be obtained 
by taking a t>o-averagc of Eq. (16). 

At this point, I would like to part company — at least 
temporarily — with Ornstein and turn to the problem of 
calculating the quantity xA. 
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4. EMENDING THE LANGEVIN ANALYSIS 

Since we are dealing with a system that is in thermal 
equilibrium, we can immediately state 



v = 0, 



RT 

Nm 



(3, say. 



(19) 



Our task is to deduce the statistical properties of A, using 
only Eqs. (15) and (19). If we take an ensemble average 
of Eq. (16), we get, after setting vq = on the right-hand 
side, 



v= f e a *M(ti)dti = 0, 
Jo 



(20) 



from which follows the relation A = 0. 

Averaging the equation found by multiplying Eq. (15) 
by 2v gives 



d v 2 

— V — = -20V 2 + 2vA = 0, 
at 



(21) 



which shows, after it is combined with Eq. (19), that 

vA = a/3. (22) 

On the other hand, multiplication of Eq. (16) by A(t) 
and averaging gives 



Av = e- at f e atl A(t)A(h)dh. 
Jo 



(23) 



On equating the right-hand sides of the last two equations 
one finds 



a(3 = e~ at [ e^A^Aih jdh 
Jo 



(24) 



a relation that can only hold if A(t)A(t\) possesses the 
two properties expressed below; in the first place 

fO, litx^t 



A{t)A{t{) = I (25a) 
I oo if ii = i, 



and secondly 



A(t)A(ti)dt-i = a/3. 



(25b) 



We have now completed the preliminaries for finding Ax. 
Integration of Eq. (15) from to t provides a relation 
connecting x, v, and A: 



v(t) -v = -a[x(t) - x(0)] + [ A(t 1 )dt 1 

Jo 



(26) 



We now multiply Eq. (26) by A{t) and take an average 
to find, after some obvious rearrangement, 



Ax = — 

a 



A(t)A(t 1 )dt 1 - vA 



0. 



(27) 



The second equality follows from the fact that each term 
within the square brackets equals a(3. After this demon- 
stration, we can justifiably retrace the steps that took 
Langevin from Eq. (4) to Eq. (6), and integrate our ver- 
sion of Eq. (6) to get 



dxv 
~df 



= —a xv + /?, 



(28) 



which can itself integrated to obtain 



— & h 
xv = — ( 1 

a v 



*)■ 



(29) 



Since dx 2 jdt = 2xv , we finally get, with xq = x(0), 



2 2/3 
x + — 
a 



t 



(30) 



a relation that Langevin could have derived but chose 
not to do so. He was reluctant, as was de Haas-Lorentz 
a few years later, to go beyond the expression derived by 
Einstein. Needless to say, this result was first obtained 
by Ornstein 5 by averaging his i>o-average x 2 over a 
Maxwellian distribution of initial velocities. 



4.1. A closer look at correlations 

We see from Eq. (29) that the correlation of x and v, 
assumed to be zero initially, rises exponentially (with a 
time constant 1/a) to its stationary value. On the other 
hand, the average vA turned out to be a constant; this is 
simply because we have assumed the corresponding time 
constant to be zero. It is tempting to tinker with the 
autocorrelation of A and assign it a small but non-zero 
decay time by setting 



A(ti)A(t 2 ) - CXer^ 2 -^, C= constant. (31) 

Should we change our mind, we can always recover the 
results previously obtained by letting A — > oo and assign- 
ing C an appropriate value. Substituting the right-hand 
side of the last equation into Eq. (23) and integrating we 
get 



vA = 



CX 
\ + a 



1 _ e -(A+at) 



(32) 



Though this result looks appealing, it should be rejected, 
when dealing with a system that is in thermal equilib- 
rium, because a time-dependent value of vA violates the 
cquipartition principle: according to Eq. (21), vA must 
be a constant, and this constancy is incom patible with 
any choice that makes the autocorrelation A(t\)A(t2) 
decay at a finite rate. Ornstein and van Wijk (O&vW), 
who dealt with overall averages rather than uo-averages, 
emphasized that the statistical properties of A are deter- 
mined as soon as one writes down Eqs. (15) and assumes 
that the system is in thermal equilibrium. Manoliu and 
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Kittel overlooked this restriction when they replaced the 
white spectrum of A by a Lorentzian; 24 it will be pointed 
out presently that the spectrum of A can be made col- 
ored (nonwhitc) only by modifying the equation of mo- 
tion itself. The introduction of Eq. (31) did not lead 
to an immediate generalization of our treatment, but it 
can still serve as a crutch for those students who arc 
unable to jump from Eq. (24) to Eq. (25), or feel uncom- 
fortable with an alternative (given below) involving the 
Dirac delta function; in the next section, it will serve as 
a part of a scheme devised for generalizing Eq. (15). 



4.2. A critique of the Langevin equation itself 

Let us set m — 1 so that we may speak of the force 
rather than acceleration, and let w(t) = —av + A de- 
note the total force acting on the particle. O&B agreed 
with van der Waals that for t = 0, w(t) = 0. Besides 
it is evident, they remarked, that for t infinite the av- 
erage value of w(t) undergoes no influence from vo and 
therefore must be zero. They noted that the course of 
w(t) = must be such that it starts from zero, attains 
a maximum (or minimum) and then falls to zero again. 
On the other hand, Eqs. (15) and (18) show that 

W v ° = -av v " + A v ° = -ae- at v . (33) 




12 3 4 

t/a 



Fig. 1. Plots showing the time dependence of the average 
force acting on a Brownian particle that started with a ve- 
locity Vo > 0. The solid curve depicts the idealized behavior 
predicted by Eq. (15), whereas the dotted curve is obtained 
by plotting the right-hand side of Eq. (34) with A = 10a and 
C= 12. 

Assuming that v n > 0, they drew two curves, one of 
which, labelled by them as 'Einstein', is a plot of the 
right-hand side of Eq. (33), and the other (which they 
call the true curve) is a schematic plot of the behaviour 
described above. I have taken the liberty of converting 
their qualitative, freehand figure to a quantitative plot, 



by using the following expression 

/ -At _ e -at\ 

w v = c V ^ iq = constant ). (34) 

A — a 

The reasons for choosing the above form will be explained 
shortly; at present, it is enough to note that the resulting 
curve resembles that drawn by O&B, and to proceed by 
quoting their comments on the shapes of the curves: 

For t — the line, which represents this 
course deviates from the true curve. The 
important agreement existing between Ein- 
stein's theory and the experiment now makes 
us presume, that the true w(t)-t curve and 
the curve according to Einstein only deviate 
from each other for short times after the de- 
parture of the particle with the velocity vq 
so that the maximum in the true curve lies 
close to t = 0, and that from this maximum 
onward it descends pretty well exponentially 
according to Einstein's curve. It goes with- 
out saying that these are only assumptions, 
which a calculation of the true w(t) curve 
must prove from the molecular theory. We 
are however of the opinion that it is worth- 
while to point to this possible interpretation 
of Einstein's master-stroke in the theory of 
the Brownian motion. 

It is remarkable that O&B recognized the limitations 
of Eq. (15) as far back as 1918, and even pointed out the 
nature of the refinment that was needed. A generaliza- 
tion of Langevin's equation in the form 

± = -£^a(t-t')v(t') + A(t). (35) 

came almost fifty years later. 27 It can be shown that 
a(t— t') is proportional to the autocorrelation of A, which 
explains the choice made in Eq. (34): if we take a vq- 
average of Eq. (35), and use a zero-order approximation 
W" a = v cxp(— at) on the right-hand side, and apply 
Eq. (31), the integral would equal (apart from a multi- 
plicative constant) the right-hand side of Eq. (34). 



5. A RANDOM WALK IN VELOCITY SPACE 

Both Langevin and Ornstein began with two claims: 
each stated that though the desired result (s) could be 
found by a re-working of an existing analysis, each would 
rather present a derivation of his own. Ornstein men- 
tioned that the time dependent averages can be calcu- 
lated from the results obtained by de Haas-Lorentz, 10 
who credited the gist of the method used by her to 
a 1910 article (not concerned with Brownian motion) 
by Einstein and Hopf. 28 She integrated Eq. (15) from 
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t = (N — l)r to t = (N — 1)t, thereby converting it into 
a difference equation: 



vn - vn-i 



-cuvn-it + In-i, 



where 



'N-l 



i-Nt 

J(N-1)t 



(36a) 



(36b) 



The time interval r is so short that v does not change 
substantially during this interval, but the fluctuations 
in A arc supposed to be extremely rapid even on this 
short time scale. With this background, we can already 
state two statistical properties of the random term on the 
right-hand side of Eq. (36a): 



7T° = o 



ijh — <%n 1; s i: 



1, Hj = k 
0, otherwise. 



(37) 



(38) 



We have arrived at Eq. (36a) by discretizing Eq. (15), 
but an alternative point of view, adopted by Gunther and 
Weaver, 29 is worth a mention: One can regard Eq. (36a) 
as the basic equation in its own right, Io, h, ••• being 
a sequence of mutually independent random variables; 
a great virtue of Eq. (36a) is its amenability to Monte 
Carlo simulation. 29 ' 30 Regardless of one's interpretation, 
Eq. (36a) can be immediately understood as describing a 
random walk in velocity space, and its solution requires 
only summation of geometric series, as demonstrated first 
by de Haas-Lorentz; her contribution has been underes- 
timated, partly because she herself called it the method 
of Einstein and Hopf, but mainly because she did not 
extract the full time-dependence from her expressions. 

It follows from iteration of Eq. (36a) that vn can be ar- 
ranged as a sum. Since this section is supposed to provide 
a discrete variant of Ornstein's approach, we will express 
the sum so as to highlight the correspondence between 
Eq. (16) and its discrete counterpart shown below: 



v N 



v p + P 



N 

fe=l 



h-iP 



where 



p = (1 — err). 



(39) 



(40) 



Let us now take a vo-average of Eq. (39), and arrive, after 
using Eq. (37), at the relation 



VN 



vop 



N 



(41) 



We would expect, on letting TV — > oo and r — > 0, to re- 
cover the results found by Ornstein if we make the iden- 
tifications Nt — t, so that vn — v(t), xn — x(t), and so 



on. Since we are going to let r 
write 



0, it is permissible to 



or 



-at 



(42) 



We have now established that, as r — > 0, one gets 

W v ° = v e- at , (43) 



a result that coincides with that stated in Eq. (18). 

Upon squaring-and-averaging, Eq. (39), and making 
use of Eqs. (37) and (38), we are left with 



N-l 



vlp^ + p^U^p- 2 " 
~P 2 



»oV" + ni 



1 



(44) 



As we let r — > 0, the first term on the right-hand side 
goes into Wpe^ 2 "*, whereas the factor within the square 
brackets will approach 

1 



2ar 



(1-e 



-2at\ 



Since v 2 must tend to (3 as t — ; 
limiting value 

lim — = 2aB, 
and arrive thereby at the result 



oo, we choose the 



(45) 



v n e 



-2at 



+ 0(1 



-2at 



(46) 



Gunther and 
as well as a; 2 ° , 



derived first by Ornstein. 

There is no need to go further since 
Weaver 29 have shown how to obtain v 2 
but two comments are still in order. First, they were 
apparently unaware that the bulk of the calculation had 
already been done by de Haas-Lorentz; secondly, they ar- 
rive at a formula which may be written (when allowance 
is made for the difference in notation and a typographical 
error in their paper) as follows: 



(X - Xq) 2 



v 



%1 



+ 



20 



e -ot)2 

3 - Ae- at 



t - 



+ e 



-2at 



2a 



(47) 



At this stage, they only point out that if one sets v 2 = /?, 
the expression for the mean-squared displacement given 
in Eq. (47) reduces to that 'obtained using the classical 
Langevin equation', namely that given in Eq. (30). Of 
course, both expressions pertain to the Langevin equa- 
tion, and their difference can be traced to different aver- 
aging procedures. 

Continuing in the same vein, one can recover all the 
mean values calculated by U&O; a verification of this 
statement will not be offered in these pages. 



8 



6. THE DELTA FUNCTION 



by the properties 



Ornstein's handling of integrals involving the autocor- 
relation A(t 2 )A(t\) reveals an uncanny prescience of 
an impulse function (some ten years before Dirac 31 in- 
troduced the delta function). Readers who do not have 
easy access to Ornstein's 1917 article may examine the 
two 1927 articles, where the original calculations are re- 
peated. In all these papers, Ornstein encounters the au- 
tocorrelation when he calculates v 2 ; by squaring both 
sides of Eq. (16) and performing a uo-average he finds 



-2at 



+ 



ft ft 

/ dhe at i / e at M(ti)^(t 2 )*°<ft2. (48) 
Jo Jo 

At this point he argues that A(ti)A(t 2 ) ° must be differ- 
ent from zero, since v 2 ° for t = oo is positive, but it is 
clear that the correlation can be appreciable only when 
the difference between t\ and t 2 is small. He introduces 
r = t 2 — ti as a new variable and proceeds as follows: 



f dhe atl f e at2 A(t 1 )A(t 2 ) V °dt 2 = 
Jo Jo 

f 

Jo 



dti e 



Ait^Aih+r) dr, 



(49) 



adding that the replacement of t\ + 1 2 in the exponent by 
2t 1 is justified because only the region around r = con- 
tributes to the second integral. Finally, he states that it 
is easily seen that the second integral is independent of t\ 
and t 2 , and is a characteristic constant for the problem at 
hand, and may be denoted by a\. To a modern reader, it 
shoul d be evident that Ornstein identified the autocorre- 
lation A(ti)A(t 2 ) V ° with aiS(ti -t 2 ). In the 1917 article, 
he also evaluated v(t)A(t) through the same reasoning 
and found that its value to be a\/2. It will be instructive 
to repeat this calculation using delta calculus. We have 



f e atl v(t)A(t 
Jo 



i) dti 



I 



at I at i 



a i 



e Qtl ai(5(i-ti)dti = (50) 



since 



5{x)dx 



5{x)dx 



(51) 



Ornstein never used the name delta function but he did 
use, in two joint publications with Uhlenbeck, the symbol 
S. In their 1930 article, 26 U&O mention that 



4nDt 



1/2 



^-{x-xoY /ADt 



is that solution of the diffusion equation which "for t = 0, 
becomes 5(x — xq), when S(x) means the function defined 



S(x) = for x^Q 
S(x)dx = 1". 



Seven years later, Ornstein and Uhlenbeck 32 used the 
symbol S(E — e) for expressing the condition "that at t = 
all particles have the energy E," and added u S(E — e) is 
the well-known singular peak function" . However neither 
U&O nor O&vW chose to take advantage of the delta 
calculus, and preferred to carry out their manipulations 
using the pre-Dirac manoeuvres pioneered by Ornstein; 
in these two papers, the authors change both variables 
by setting u = (t\ + t 2 ) and w = {t\ — t 2 ). I will not 
speculate about their preference, but will now raise, and 
later answer, a different question: Why has Ornstein not 
been credited with the invention of the delta calculus? 

The answer is not far to seek. The second section of 
the 1927 paper of Dirac is devoted to notation, where he 
states: 

One cannot go far in the development of 
the theory of . . . without needing a notation 
for that function of ... x that is equal to zero 
except when x is small, and whose integral 
through a range that contains the point x = 
is equal to unity. We shall use the symbol 
6(x) to denote this function, i.e., S(x) is de- 
fined by 



S(x) = when x ^ 0, 



and 



/oo 
dx 5(x) = 1. 
-oo 

Strictly, of course, S(x) is not a proper func- 
tion of x, but can be regarded only as a limit 
of a certain sequence of functions. All the 
same one can use S(x) as though it were a 
proper function for practically all the pur- 
poses of . . . without getting incorrect results. 
One can also use the differential coefficients of 
6(x), namely S'(x), 6"(x) . . ., which are even 
more discontinuous and less "proper" than 
S(x) itself. 

A few elementary properties of these func- 
tions will now be given so as not to interrupt 
the argument later. We can obviously take 

S(x) = 5(-x), 5'{x) = -S'(-x), etc If 

f(x) is any regular function . . ., we have 



I 

J —i 



f(x)5(a - x)dx = f(a), 



so that the operation of multiplying by 6 (a — 
x) and integrating with respect to x is equiv- 
alent to the operation of substituting a for 



x. 
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What we now call Dirac's delta function might well 
have been named after Ornstcin, if the latter had inter- 
rupted his argument to introduce, with appropriate fan- 
fare, a new function, spelling out its principal properties 
once and for all, and giving it a smart name. 



7. THE NAMING OF THE PROCESS AND THE 
PROCESS OF NAMING 

With the exception of two papers, where Ornstein was 
the only author, and a third, where he had more than one 
coauthor, each of his other papers in the period 1917-33 
was a joint effort with a single coauthor: Burger, Uhlen- 
beck, van Wijk and Zernikc. Though Uhlcnbeck is the 
first author of the 1930 article, the random process in- 
vestigated in the article is now called Ornstein- Uhlenbeck 
process. The person responsible for this reversal of or- 
der is, as far as I can see, Doob, 33 who must have been 
under the impression that Ornstein was the first author 
(as can be verified by examining his list of references); 
since he refers, throughout his paper, to 'Ornstein and 
Uhlcnbeck' it stands to reason that he introduced the 
name 'O.U. process'; subsequent authors have chosen to 
replace the initials by complete names without reversing 
their order. Since singling out Uhlenbeck among all the 
coauthors named above seems (to me) unfair, and includ- 
ing them all is clearly impractical, the choice 'Ornstein 
process' would have been more equitable, but the name 
'Ornstein-Uhlenbeck process is here to stay. 

Ornstein died in 1941. 34 Four years later, Ming Chen 
Wang and Uhlenbeck published their celebrated review 
article. 35 By now, they were unequivocal in calling 
Eq. (15) the Langevin equation; they also used the ap- 
pellation 'the Dirac delta function' and used it for spec- 
ifying the temporal behavior of the autocorrelation of A 
by stating the relation 



A(t 1 )A(t 2 ) = 2a/35(t 1 -t 2 ). 
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